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IN OR RELATING TO SWITCHING DEVICES 



The present invention relates to improvements in or relating to 
switching devices, and is more particularly concerned with a system for 

5 providing distributed schedules for such a device. In particular, the 

invention relates to the distribution of control between a management card 
and line interface cards (LICs) on a switching device. For the purposes of 
the following description the term switching device refers to any device 
which performs the function of a circuit switch, packet switch or a router. 

10 Data is transferred over the Internet by means of a plurality of 

routing devices in accordance with a standard protocol known as Internet 
Protocol (IP). IP is a protocol based on the transfer of data in variable 
sized portions known as packets. All network traffic involves the 
transportation of packets of data. Routers are devices for accepting 

1 5 incoming packets; temporarily storing each packet; and then forwarding the 
packets to another part of the network. 

Traffic volume in the Internet is growing exponentially, almost 
doubling every 3 months, and the capacity of conventional IP routers is 
insufficient to meet this demand. There is thus an urgent requirement for 

20 products that can route IP traffic at extremely large aggregate bandwidths 
in the order of several terabits per second. Such routing devices are termed 
"terabit routers". 

Terabit routers require a scalable high capacity communications path 
between the point at which packets arrive at the router (the "ingress") and 
25 the point at which the packets leave the router (the "egress"). 

The packets transferred in accordance with IP can (and do) vary in 
size. Within routers it has been found useful to pass data in fixed sized 
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units. In routers, the data packets are partitioned into small fixed sized 
units, known as cells. 

One suitable technique for implementing a scalable communications 
path is a backplane device, known as a cell based cross-bar. Data packets 
5 are partitioned into cells by a plurality of ingress means for passage across 
the cross-bar. 

The plurality of ingress means provide respective interfaces between 

incoming communications channels carrying incoming data and the 

backplane device. Similarly a plurality of egress means provide respective 
10 interfaces between the backplane device and outgoing communications 

channels carrying outgoing data. 

A general terabit router architecture bears some similarity to 

conventional router architecture. Packets of data arrive at input port(s) of 

ingress means and are routed as cells across the cross-bar to a 
1 5 predetermined egress means which reassembles the packets and transmits 

them across its output port(s). Each ingress means maintains a separate 

packet queue for each egress means. 

The ingress and egress means may be implemented as line interface 

cards (LICs). Since one of the functions regularly undertaken by the 
20 ingress and egress means is forwarding, LICs may also be known as 

'forwarders'. Further functions include congestion control and 

maintenance of external interfaces, input ports and output ports. 

In a conventional cell based cross-bar each ingress means is 

connected to one or more of the egress means. However, each ingress 
25 means is only capable of connecting to one egress means at any one time. 

Likewise, each egress means is only capable of connecting to one ingress 

means at a time. 
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All ingress means transmit in parallel and independently across the 
cross-bar. Furthermore cell transmission is synchronised with a cell cycle, 
having a period of, for example, 108.8ns. 

The ingress means simultaneously each transmit a new cell with each 
5 new cell cycle. 

The pattern of transmissions from the ingress means across the cross- 
bar to the egress means changes at the end of every cell cycle. 

The co-ordination of the transmission and reception of cells is 
performed by a cross-bar controller. 
10 A cross-bar controller is provided for efficient allocation of the 

bandwidth across the cross-bar. It calculates the rates that each ingress 
means must transmit to each egress means. This is the same as the rate at 
which data must be transmitted from each packet queue. The calculation 
makes use of real-time information, including traffic measurements and 
15 indications from the ingress means. The indications from the ingress 

means include monitoring the current rates, queue lengths and buffer full 
flags. The details of the calculation are discussed more rigorously in the 
copending UK Patent Application Number 99073 1 3.2 (docket number 
F21558/98P4863). 

20 The cross-bar controller performs a further task; it serves to schedule 

the transfer of data efficiently across the cross-bar whilst maintaining the 
calculated rates. At the end of each cell cycle, the cross-bar controller 
communicates with the ingress and egress means as follows. Firstly, the 
cross-bar controller calculates and transmits to each ingress means the 

25 identity of the next packet queue from which to transmit. Secondly, the 
cross-bar controller calculates and transmits to each egress means the 
identity of the ingress from which it must receive. 
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The system described above does have a number of disadvantages 
however. The cross-bar controller is responsible for controlling the cell 
cycle-by-cell cycle behaviour of each ingress and egress means. At the 
rates required by a terabit router, this amounts to demanding complex 
5 hardware to implement the cross-bar controller, the ingress and the egress 
means. Furthermore the demand for higher capacity places stringent delay 
performance conditions upon the communication channels between the 
ingress and egress means and the cross-bar controller means. 

When developing a system for particular traffic conditions, it is 
10 disadvantageous to have to replace inappropriate hardware. 

It is therefore an object of the invention to obviate or at least mitigate 
the aforementioned problems. 

In accordance with one aspect of the present invention, there is 
provided a switching arrangement having:- 
15 a cross-bar; 

a plurality of ingress means connected to an input side of the cross- 
bar, each ingress means including an ingress schedule storing means; 

a plurality of egress means connected to an output side of the cross- 
bar, each egress means including an egress schedule storing means; and 
20 a management card which communicates configuration primitives to 

each of the plurality of ingress means and to each of the plurality of egress 
means, the configuration primitives providing updated entries for ingress 
and egress schedule storing means. 

It is preferred that each ingress means includes means for storing a 
25 plurality of transmission queues for transmission across the cross-bar. 

Advantageously, each ingress schedule storing means stores identities of 
said transmission queues, each transmission queue corresponding to a 
respective egress means identification number. 




-5- 

Preferably, each ingress means maintains a pointer into each ingress 
schedule storing means for identifying the transmission queue to be 
transmitted. 

It is also preferred that each egress schedule storing means stores 
5 identities of ingress means addresses from which data is to be received. 
Advantageously, each egress means maintains a pointer into each egress 
schedule storing means for identifying an ingress means address from 
which data is to be received. 

In accordance with another aspect of the present invention, there is 
10 provided a method of routing data using a switching arrangement as 
described above, the method comprising :- 

a) storing a plurality of transmission queue identities in each 
ingress schedule storing means; 

b) storing a plurality of ingress identities in each egress schedule 
1 5 storing means; 

c) managing the contents of each ingress schedule storing means 
and each egress schedule storing means by providing ingress pointer means 
to reference one of said stored plurality of transmission queue identities and 
egress pointer means to reference one of said stored plurality of ingress 

20 identities from which data is to be received; and 

d) at each cell transmit time, transmitting a cell from said 
referenced transmission queue in the ingress means and receiving the cell 
from said referenced ingress identity. 

Preferably, step d) further comprises moving said ingress pointer and 
25 said egress pointer to the next location. 

It is preferred that step c) comprises, for each ingress means, 
calculating cross-bar rates required to each egress means. Said cross-bar 
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rates may be calculated according to current traffic load and quality of 
service required. 

Additionally, step c) further comprises calculating corresponding 
ingress and egress schedules which satisfy said calculated cross-bar rates. 
5 The method further comprises the step of updating the ingress and egress 
schedule storing means with update messages relating to the calculated 
ingress and egress schedules. 

The present invention provides a routing device having a plurality of 
ingress line function means, a plurality of egress line function means, a 

10 backplane and a management card; each ingress line function means 
having: a schedule storing means, for storing a schedule of egress line 
function means addresses; a pointer storing means, for storing a pointer to 
an address held in the schedule storing means; and a queue storing means, 
for storing a plurality of ingress queues of cells for transmission across the 

1 5 backplane, each one of the plurality of ingress queues corresponding 

uniquely to a predetermined one of the egress line function means; wherein 
the management card communicates configuration primitives to each of the 
plurality of ingress line function means and to each of the plurality of 
egress line function means, the configuration primitives providing updated 

20 entries for the schedule. 

The ingress line function means may be line interface cards. 
Likewise the egress line function means may also be line interface cards. 

As a result of the present invention, simple schedules for cell 
transmission and reception across the cross-bar are sent to each LIC by the 

25 controller. The controller is responsible for formulating the schedules to 
ensure that traffic and quality of service requirements are met. The LICs 
merely have to obey the schedules, a simple task. The principle benefits 
are reduced complexity of the LICs, and flexibility. New services can be 
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added to the router by software download without impact on the LIC 
hardware. 

For a better understanding of the present invention, reference will 
now be made, by way of example only, to the accompanying drawings in 
5 which:- 

Figure 1 illustrates a terabit router architecture; 
Figure 2 shows a cross-bar controller; and 

Figure 3 shows a system for distributing schedules according to the 
present invention. 

10 Although the present invention will be described with reference to 

ingress and egress forwarders, it will readily be understood that the present 
invention is not limited to the use of such forwarders. Any suitable line 
function means can be used for ingress and egress, for example, line 
interface cards (LICs). 

15 Figure 1 illustrates a conventional terabit router architecture 100 in 

which packets arrive at ingress forwarders 102, 104, 106 via their input 
port(s) (not shown) and are routed across a cross-bar 1 10 to a correct egress 
forwarder 120 which transmits them across its output port(s) (not shown). 
Each ingress forwarder 102, 104, 106 maintains a separate packet queue for 

20 each egress forwarder 120. 

Ingress forwarder 102 has three queues qn, qi2, qn of data packets 
ready for transfer to three separate egress forwarders, only one of which is 
shown as 120, via the cross-bar 1 10. Similarly, three queues q 2 i, qn, q23 
and q 3J , q 32 , q 3 3 are formed respectively in each of the ingress forwarders 

25 104, 106. Although three queues are shown in each ingress forwarder 102, 
it will be appreciated that any number of queues can be present in each 
ingress forwarder 102, 104, 106, each queue corresponding to an egress 
means. 
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It will be appreciated that although only one egress forwarder 120 is 
shown in Figure 1 , the number of egress forwarders will be the same as the 
number of ingress forwarders. 

By way of explanation, a cell based cross-bar is characterised as 
5 follows: 

a) Each ingress line function may be connected to any egress line 
functions. 

b) Each ingress line function may only be connected to one 
egress line function at a time. 

10 c) Each egress line function may only be connected to one 

ingress line function at a time. 

d) All ingresses transmit in parallel across the cross-bar. 

e) Data is transmitted across the cross-bar in small fixed sized 
cells, for example, a cell size is typically 64 octets. 

15 f) Cell transmission is synchronised across all the ingress line 

functions. This means that for each cell cycle, each ingress line function 
starts transmitting the next cell at the same time. 

g) The cross-bar is reconfigured at the end of every cell cycle. 
As shown in Figure 1, packets of data arriving at the ingress 

20 forwarders 102, 104, 106 via their input port(s) (not shown) and are routed 
across the cross-bar 1 10 to the correct egress forwarders 120 which 
transmits them across its output port(s) (also not shown). Each ingress 
forwarder 102, 104, 106 maintains a separate packet queue for each egress 
forwarder 120, for example, q n , q 12 , qn, q2i> q22, q23> q3i, q32> q33- 

25 A conventional cell based cross-bar arrangement 200 is shown in 

Figure 2. The arrangement 200 comprises a plurality of ingress forwarders 
210 and a plurality of egress forwarders 220 connected to a cross-bar or 
backplane 230. Here, each ingress forwarder 212, 214, 216, 218 may be 
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connected to one or more of the egress forwarders 222, 224, 226, 228. 
However, as mentioned above, each ingress forwarder 212, 214, 216, 218 
may only be connected to one egress forwarder 222, 224, 226, 228 at a time 
and each egress forwarder 222, 224, 226, 228 may only be connected to 
5 one ingress forwarder at a time 212, 214, 216, 218. 

The cross-bar arrangement 200 is controlled by a cross-bar controller 
240 which is connected to each ingress forwarder 212, 214, 216, 218 via 
links 242, 244 and to each egress forwarder 222, 224, 226, 228 via link 
246. The cross-bar controller 240 co-ordinates the transmission and 

10 reception of cells via links 242, 244, 246. 

Each ingress forwarder 212, 214, 216, 218 communicates traffic 
measurements and notifications for the use of the cross-bar controller 240. 
The cross-bar controller 240 allocates temporary connections between 
ingress forwarders 212, 214, 216, 218 and egress forwarders 222, 224, 226, 

1 5 228 and informs the respective forwarders accordingly for each cell cycle 
in turn. 

Figure 3 illustrates a system 300 in accordance with the present 
invention. The system 300 includes a plurality of ingress forwarders 310, a 
plurality of egress forwarders 320, a cross-bar 330, and a management card 

20 340 which includes cross-bar controller means (not shown). The ingress 
forwarders 312, 314, 316 and egress forwarders 322, 324, 326 are shown as 
line interface cards (LICs), but may be implemented in other ways. 

Although three ingress and three egress forwarders are shown, it will 
readily be appreciated that any suitable number of ingress and egress 

25 forwarders may be utilised according to the particular application. In this 
arrangement, it is not essential that the number of ingress forwarders and 
egress forwarders are the same, for example, a multiplexer may have 
sixteen ingress forwarders and only four egress forwarders. 
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Each ingress forwarder 312, 314, 316 has a buffer (not shown in 
detail) containing a plurality of data queues qu, qi2, qin> q2i, ^22, q2n, 
and q mJ , q^, . . q™, respectively (where m is the number of ingress 
forwarders 310 and n is the number of data queues) feeding into respective 
5 schedulers 352, 354, 356. 

Each queue can be generally represented as qjk where J indicates the 
ingress, k indicates the egress, and qjk represents the packet queue at the 
ingress j for packets destined for egress k. 

Each scheduler 352, 354, 356 is connected to a respective ingress 
10 timetable 362, 364, 366 which controls the selection of data queue to be 
transferred to the cross-bar 330 in accordance with data from the 
management card 340. 

Similarly, each egress forwarder 322, 324, 326 has a respective 
ingress selector 372, 374, 376 for receiving data from the cross-bar 330 and 
1 5 for transferring it to a buffer (not shown in detail) containing a plurality of 
data queues q'n, q' 12 > q'i P , ^21, q'22> q 5 2 P , and q' rl , q' r2 , q'rp, 
respectively (where r is the number of egress forwarders 320 and p is the 
number of data queues). Each ingress selector 372, 374, 376 is connected 
to a respective egress timetable 382, 384, 386 which provides information 
20 as to which ingress forwarder to select in accordance with data from the 
management card 340. Each data queues q'n, q'i2> ...» q'i P , q 5 2i> q*22, . •> 
q' 2p , and q' r i, q' r2 , q'rp is connected to a further scheduler 392, 394, 396 
which selects the appropriate queue to be output. This is, however, not 
essential to the present invention. 
25 The operation of the method according to the present invention will 

now be described using the system 300 shown in Figure 3. 

Each ingress LIC (ingress forwarder) 312, 314, 316 is associated 
with a respective schedule or timetable 362, 364, 366 governing the 
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transmission of cells by said ingress LIC. The schedule is in the form of a 
table whose entries are the identities of ingress LICs. Each ingress LIC 
maintains a pointer into the schedule. At each cell transmit time, the LIC 
transmits a cell from the queue identified in the entry referenced by the 
5 pointer, and moves the pointer to the next location. The schedule is 

circular in the sense that when moving the pointer from the last entry, its 
next position is the first entry. 

Each egress LIC (egress forwarder) 322, 324, 326 is also associated 
with a respective schedule or timetable 382, 384, 386 governing the 

10 reception of cells by said egress LIC. The schedule is in the form of a table 
whose entries are the identities of ingress LICs from which to receive. An 
egress LIC maintains a pointer into the schedule. At each cell transmit 
time, the LIC will receive the cell from the ingress identified in the entry 
referenced by the pointer, and moves the pointer to the next location. 

15 Again, the schedule is circular in the sense that when moving the pointer 
from the last entry, its next position is the first entry. 

The management card 340 manages the contents of the ingress and 
egress schedules or timetables 362, 364, 366, 382, 384, 386. For each 
ingress LIC (ingress forwarder) 312, 314, 316, the management card 340 

20 calculates the cross-bar rates required to each egress LIC (egress forwarder) 
322, 324, 326. The cross-bar rates are calculated according to the current 
traffic load and required quality of service. The rates are calculated, for 
example, as described in the co-pending application mentioned above. 
Alternatively, the rates could be fixed. 

25 Having calculated the rates, the management card 340 calculates 

corresponding ingress and egress schedules or timetables needed to satisfy 
them. It then updates the schedules or timetables using update messages 
known as configuration primitives on links 342, 344. Only modifications 
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to the schedules need be transmitted rather than the complete tables. This 
reduces the control traffic from the management card. 

On receipt of the configuration primitives, the LICs or forwarders 
update the schedules as requested. 
5 In order to avoid problems associated with updating the schedule, 

that is, reading and writing to the same point in schedule, the schedule may 
be partitioned so that reading and writing occur in different parts. 
Alternatively, two separate identical schedules may be provided - one 
schedule which is being read and another schedule which is being 
10 configured. 

This approach has the following advantages:- 

First, complex rate calculation and schedule calculation algorithms 
can be implemented in software or on programmable devices on the 
management card 340, reducing development risk and cost. 
15 Secondly, the hardware required to support the approach is very 

simple. 

Thirdly, the communication channels between the LICs 310, 320 and 
the management card 340 do not require stringent delay performance since 
the primitives are not directly synchronised to the cell transmissions across 

20 the backplane 330. 

Fourthly, the communication channels 342, 344 between the LICs 
310, 320 and the management card 340 require less bandwidth than would 
be the case if the management card were to reconfigure the cross-bar or 
backplane 330 every cell time. 

25 Fifthly, the behaviour of the switch/router can be modified by 

changes to the software algorithms running on the management card 340. 
This has two consequences. Different applications can be supported by the 
same hardware. For example, the same hardware could support both 
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internet protocol (IP) routing and circuit switching, including asynchronous 
transfer mode (ATM) and the related synchronous transfer mode (STM). 
Furthermore, optimisations and refinements to the algorithms can easily be 
implemented. 

It will be readily understood that although the preceding discussion 
has been in terms of terabit routers, the apparatus of the present invention 
are capable of implementation in a wide variety of switching devices, 
including switches and routers, and that these devices can be either purely 
electronic, part electronic/part optical or optical in nature. 



